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Bateman's experimental study of 
Drosophila melanogaster produced 
conclusions that are now part of the bed- 
rock premises of modern sexual selection. 
Today it is the most cited experimental 
study in sexual selection, and famous 
as the first experimental demonstration 
of sex differences in the relationship 
between number of mates and relative 
reproductive success. We repeated the 
experimental methodology of the origi- 
nal to evaluate its reliability. The results 
indicate that Bateman's methodology of 
visible mutations to assign parentage and 
reproductive success to subject adults is 
significantly biased. When combined in 
offspring, the mutations decrease off- 
spring survival, so that counts of mate 
number and reproductive success are 
mismeasured. Bateman's method overes- 
timates the number of subjects with no 
mates and underestimates the number 
with one or more mates for both sexes. 
Here we discuss why Bateman's paper is 
important and present additional analy- 
ses of data from our monogamy trials. 
Monogamy trials can inform inferences 
about the force of sexual selection in 
populations because in monogamy tri- 
als male-male competition and female 
choice are absent. Monogamy trials also 
would have provided Bateman with an a 
priori test of the fit of his data to Mendel's 
laws, an unstated, but vital assumption 
of his methodology for assigning par- 
entage from which he inferred the num- 
ber of mates per individual subject and 
their reproductive success. Even under 
enforced monogamous mating, offspring 



frequencies of double mutant, single 
mutant and no mutant offspring were 
significantly different from Mendelian 
expectations proving that Bateman's 
method was inappropriate for answering 
the questions he posed. Double mutant 
offspring (those with a mutation from 
each parent) suffered significant invi- 
ability as did single mutant offspring 
whenever they inherited their mother's 
marker but the wild-type allele at their 
father's marker locus. These inviability 
effects produced two important inaccu- 
racies in Bateman's results and conclu- 
sions. (1) Some matings that actually 
occurred were invisible and (2) reproduc- 
tive success of some mothers was under- 
estimated. Both observations show that 
Bateman's conclusions about sex differ- 
ences in number of mates and reproduc- 
tive success were unwarranted, based on 
biased observations. We speculate about 
why Bateman's classic study remained 
without replication for so long, and we 
discuss why repetition almost 60 years 
after the original is still timely, necessary 
and critical to the scientific enterprise. 
We highlight overlooked alternative 
hypotheses to urge that modern tests of 
Bateman's conclusions go beyond confir- 
matory studies to test alternative hypoth- 
eses to explain the relationship between 
mate number and reproductive success. 

The Paper that Shaped Studies 
in Sexual Selection 

A.J. Bateman's experimental study of 
Drosophila melanogaster 1 has had a huge 
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impact on empirical and theoretical 
studies in modern sexual selection. 2 " 7 
Bateman's conclusions are now part of the 
bedrock premises of modern sexual selec- 
tion and they are foundational ideas in 
the evolutionary study of sex differences. 2 
Bateman's results were that (1) male num- 
ber of different mates (NM) was greater 
than female NM, (2) males had greater 
variation in number of adult offspring, 
i.e., "reproductive success" (RS) than 
females, and (3) because of the number 
of mates, which Bateman inferred from 
plots of RS against NM (now known as 
"Bateman gradients"). Bateman said that 
his data showed that sex difference in the 
variance in number of offspring was the 
sign (italicized in the original) of "intra- 
masculine selection" while the cause of 
selection among males was a "stronger 
correlation" in males between NM and 
RS. His conclusions were consistent with 
Darwin's discussion 8 of the evolution of 
elaborate male traits via female choice and 
male-male competition. 

In Bateman's time it was impossible 
to do the carefully controlled parentage 
assignment study he attempted on any 
organism other than D. melanogaster. 
The work of the great early 20 th century 
drosophilist, Thomas Hunt Morgan, 
provided the necessary tools when he 
discovered, cultured, and maintained 
lines of flies with spontaneously arising 
and phenotypically dramatic mutations 
and then explored Mendelian inheritance 
patterns according to the basic rules of 
Mendelian genetics (see ref. 9). Bateman's 
study simply could not have been imag- 
ined before there was a solid understand- 
ing of the origins and inheritance of 
phenotypically-obvious mutations in an 
experimentally tractable model organism, 
putting Bateman's "discoveries" in the 
very large bin of experimental firsts with 
D. melanogaster. 

Trivers's rediscovery 10 of Bateman's 
previously little-cited paper on D. mela- 
nogaster 1 propelled Bateman's inferences 
onto center stage in studies of sexual 
selection and sex differences. Deservedly 
so, as it was the first study that seemed to 
experimentally demonstrate what Darwin 
hypothesized about the force of female 
choice and male— male combat 8 on varia- 
tion among male rivals in their number 



of mates. It was the first experimental 
study to conclude what Fisher proposed: 
a greater variance in fitness among males 
associated with sexual selection on males 
(measured as number of eclosed off- 
spring). 11 Until 2007, 12 it was unrecog- 
nized that the paper also provided the first 
experimental demonstration that mat- 
ing with more than one male enhances 
females' number of offspring. Bateman 
claimed he had confirmed Darwin's 
ideas about the force of female choice 
and male-male competition 1 on male 
fitness variance. 8 He said he had shown 
"....that sexual selection is more effective 
in males than females," 1 (p. 363) which he 
argued was consistent with the idea that 
males are not as discriminating in mating 
behavior as females, an argument that was 
interpreted for a very long time as though 
there was no evidence for fitness payouts 
to females who mated with more than one 
male or for the fact that males are choosy 
too. Bateman also argued that egg num- 
ber limits female reproduction, which ". . . 
causes a severe strain on their nutrition" 
so that reproductive capacity limits female 
reproductive success, but sperm limita- 
tion was unlikely to limit male reproduc- 
tion. In other words, Bateman argued 
that resources limit females, but that 
females limit males. "With intramascu- 
line selection males will be expected to show 
polygamous tendencies whereas in females 
there would be selection in favour of obtain- 
ing only one mate after which they would 
become relatively indifferent" (p. 367). 
These ideas are ones that took hold and 
spread. 

The parental investment (PI) paper 
of Trivers 10 inspired a generation when it 
linked sex differences in PI to the evo- 
lution of genes for choosy behavior in 
females and indiscriminate behavior in 
males. Among the most important results 
of the PI idea was its prediction that when 
PI is greater in males, males would be "the 
choosy sex" and females the indiscrimi- 
nate sex. 

After Trivers, Arnold seized on 
Bateman's ideas and called the results 
"principles": 2 truths to count on or at 
least important assumptions to test. In 
a nutshell, what Bateman did with the 
help of these authors was anchor within- 
sex variance in number of mates (V.,. .) as 



the key predictor of variance in reproduc- 
tive success (V RS )," which is sometimes 
considered the only measure of sexual 
selection. Bateman's paper is part of the 
standard historical trajectory in the con- 
troversial paradigm of the origins of sex 
differences. 3,7 

Why did We Replicate 
Bateman (1948)? 

Given that replication is a pillar of the 
scientific method, the previous lack of 
strict repetition seemed odd to us. A few 
previous investigators had evaluated and 
questioned Bateman's methods, 12 his 
results, 7 the sweep of his conclusions 3 
and the implications others took from 
his study. 3 ' 7,13 In particular, we won- 
dered if the obvious failure of data in 
Bateman's Table 4' (reproduced in Fig. 1) 
to match Mendel's expectations was a 
systematic problem associated with all of 
his populations. We wondered: Would a 
strict repetition of Bateman's methods 
calm our concerns and those of earlier 
critics? 

We also wondered if other explanations 
might explain the key results as well as or 
better than sexual selection seemed to do, 
so we planned to test the results against 
the predictions of sexual selection and 
stochastic demography (i.e., chance varia- 
tion in individual survival and/or chance 
encounters with potential mates). 14,15 

In addition, we were interested in fol- 
lowing up the questions about the valid- 
ity of Bateman's method that arose when 
we 12 were reading his paper to ensure that 
our repetition was as faithful as possible to 
its methodology, which is what we mean 
by "strict." Our reading identified several 
problems with the methodology. The most 
important was obvious from Bateman's 
Table 4 (Fig. 1), which he included as an 
example of how he inferred numbers of 
mates and number of offspring per sub- 
ject. The frequency of offspring inherit- 
ing and phenotypically expressing the 
dominant marker gene from each par- 
ent was significantly lower than the Va 
that Mendel's rules require (Fig. 2), sug- 
gesting that in combination, the marker 
mutations lowered offspring viability and 
potentially could have biased observations 
of NM and RS. 
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TABLE 4 

Sample result of one mating. From series 6 BxA, sixth bottle 
{CyL, Cy, Mc) X (Pm, H, Sb) 





CyL 


Cy 


Mc 


+ 


Total 


No. of mates 


Sb 


■ 13 


0 


0 


16 


29 


1 


Pm 


10 


12 




68 


105 


3 


H 


7 


29 


0 


41 


77 


2 


+ 


60 


38 


40 


1 10 


248 




Total . 

No. of mates 


9«> 

3 


79 

s 


SS 
1 


235 


459 





Figure 1. Bateman's (1948) Table 4, p. 357 (reprinted by permission from Macmillan Publishers Ltd. License Number: 2960430598991), showed the 
logic of Bateman's method for assigning parentage and thus inferring NM and RS per adult in a given population. The table is the only completely 
displayed data for any population in Bateman's study. The tabled values are the number of offspring that carried a parental marker. The only offspring 
that allowed inference of NM were the double mutants — M'M". In the table M'M" offspring include the 13 that inherited the mother's CyL dominant 
gene and the father's Sb dominant gene, and so on. To estimate the number of offspring that each mother produced, he took the sum of their M 9 M" 
+ M 9 w" offspring and for each father he took the sum of their M 9 M" + w'M" offspring. Most important, however, is that from the data in this table one 
can compute the frequencies of M'M", M'w", w'M" and w 9 w" and compare them to Mendel's expectations under the assumption that each adult was 
a dominant heterozygote at a unique locus and homozygous wild type at every other adults marker loci. The values are a significant departure from 
Mendelian expectations with M'M" significantly fewer than 25%.' 2 



"Repetitions" using modern molec- 
ular genetic studies, of course, exist. 
However, these are not repetitions 
in the strict sense, but rather stand- 
alone tests of "Bateman's principles." 
Tang-Martinez 16 has recently reviewed 
these studies and shown that while many 
studies confirm Bateman's principles, 
others do not. Studies reporting results 
inconsistent with Bateman's principles 17 " 20 
and the many studies showing a benefit 
of polyandry 21 " 24 for females implicitly 
reject Bateman's conclusion that selec- 
tion does not favor female multiple 
mating. The studies that demonstrated 
consistency with Bateman's princi- 
p| es 4,6,25-28 pjq Uec j our interest further, as 
alternative explanations for Bateman's 
observations besides sexual selection 
have also been available for at least 
25 years; yet few workers have tested 
their observations against these alter- 
natives. 15,25 A repetition could have 
put our minds to rest about the valid- 
ity of Bateman's conclusions and might 
have provided an unbiased evaluation 
of mate number and reproductive suc- 
cess that would allow robust tests of the 
currently obvious alternatives. 15,29 It 
seemed to us way past time to repli- 
cate Bateman's study using his original 
methodology. 



What the Repetition Discovered 

Our repetition appears to be unique 
in that we tried to replicate exactly 
Bateman's methodology of parentage 
assignment as far as we could. We used 
the same mutant lines Bateman used. In 
our study, we cultured adult subjects as 
Bateman had done, so that each expressed 
a unique genetically-determined marker 
phenotype (Fig. 3A and B). Table SI in 
our original report 30 illustrates that each 
adult subject was genetically and pheno- 
typically distinct; that is, (regardless of 
their sex) in each of our replicated popu- 
lations, each adult carried a single allele at 
its marker locus, while having only wild- 
type alleles at all other subjects' marker 
loci. We included the cultured adults in 
experimental populations in the same 
combinations of sexes, markers per sex, 
age of males and females and duration 
of the period during which mating could 
occur that Bateman reported (see Table 
SI in ref. 30). 

Following Bateman's method closely, 
we counted the number of mates using 
only the offspring with a mutation from 
each parent, the double mutant off- 
spring, M'M^s. According to Bateman, 
M 9 M tf s allowed an unbiased count of how 
many mates the subject adults had. Like 



Bateman we used the sum of the double 
mutant offspring plus the single mutant 
offspring (those with a marker mutation 
from one parent only) as an estimate of the 
number of offspring for each subject adult 
(see Fig. 1). 

Like Bateman we did not watch the 
mating behavior of our subjects, so we 
had no better estimate of who mated with 
whom than Bateman did. Thus, the verac- 
ity of Bateman's study and our replication 
turned on the "fairness" of the parental 
markers: Were they neutral with respect 
to Bateman's goals of inferring number of 
mates and number of offspring for each 
subject? This question is one that mod- 
ern forensic scientists and those studying 
genetic parentage also must do and do in 
fact address. If markers fail to fit Mendel's 
rules or Hardy-Weinberg expectations or 
are otherwise non-neutral, modern geneti- 
cists exclude particular loci from use in 
their studies of forensics, kinship, and 
genetic parentage. 31,32 Bateman-era geneti- 
cists would have used the simple expecta- 
tions from a consideration of Mendelian 
principles to test the fairness of the mark- 
ers (Figs. 2 and 3): Bateman did test if 
half the offspring could be assigned to 
mothers and half to fathers, and concluded 
that there was no statistical difference 
in the representation among offspring of 
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Figure 2 (see previous page). A cartoon of the even frequencies of offspring phenotypes when parents are unique heterozygote dominants each at 
a unique locus. Drosophilist Sergio Castrezana, PhD, painted the image styled as Mayan-like hieroglyphs in a Mexican bark painting in black ink and 
acrylic on amate paper. 
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Figure 3. A stylized view of offspring genotypes (A) and observable offspring phenotypes (B) 
when each parent is a heterozygote dominant at a unique marker locus and wild type at the other 
parent's marker locus. In both panels capital letters indicate dominant alleles and lower case 
indicates wild-type alleles for two parents each with a dominant marker allele each at a differ- 
ent locus. The male's maker locus is indicated by "B" and the female's by "R." Wild-type alleles at 
mother's marker locus are indicated by lowercase, "r" and at father's marker locus by lowercase 
"b." In (B) the bolded letters indicate visible mutations. 



mothers' vs. fathers' marker phenotypes. 
However, if he tested whether the fre- 
quency of double mutant offspring was 
unbiased, he did not report it. It appears 
he did not test his data against Mendel's 
expected frequencies as his Table 4 (Fig. 1) 
data clearly significantly depart from 
Mendelian frequencies. Such a test of the 
frequency of adult markers in offspring is 
the decisive one required for demonstra- 
tion that estimates of number of mates per 
individual and the V.... were unbiased, 
fairly representing who did and did not 
mate. 

Assuming Mendelian inheritance of 
alleles when each parent is heterozygous 
dominant at unique loci (Fig. 1), there 
should be 25% of offspring with both 
parental mutations, 25% with the domi- 
nant allele at mother's marker locus but a 
wild-type allele at father's marker locus, 
25% with the wild-type allele at mother's 
marker locus and the dominant allele at 
father's marker locus and 25% with the 
wild-type allele at each of their parents' 
marker loci (Figs. 2 and 3). We gen- 
eralize these types of offspring pheno- 
typically as M 9 M S , MV, w'M a and 
w 9 w a , respectively. The frequency of 
M'M" offspring was significantly lower 
than expected overall. 30 The fatal flaw 
that our repetition revealed is that the 
method miscounts the number of mates 
for each sex — key variables in Bateman's 
study — to an unknown degree, because 
in Bateman's study the only information 
about who mated with whom was from 
the phenotypes of the M'M" offspring. 
Using Bateman's method we coded some 
subjects as having zero mates, when they 
in fact had one or more mates, which 
was clear when we observed "zero mated 
subjects" whose phenotypes appeared in 
their single mutant — M 9 w tf and w'M a — 
offspring (Fig. 4). Thus the method 
of assigning parentage miscounted the 
number of subjects with 0, 1, 2, 3, 4 
or 5 mates. The error in counts of zero 
mates was systematically larger for males 
than for females as Figure 4 shows. Thus 
counts of NM were inappropriate, and 
by necessity the estimates of V NM were 



inaccurate and perhaps misleading to an 
unknown extent. 

The second flaw arose because sig- 
nificantly more single mutant offspring 
survived when they had their mother's 
wild-type allele and their father's mutant 
allele (that is, the w'M^s) than offspring 
with their mother's marker allele and 
their father's wild-type allele (the M'w^s), 
the sex differences in reproductive suc- 
cess were also biased, showing higher RS 
for fathers than for mothers. Although 



Bateman did report that the numbers of 
offspring from M ! w" and w'M" were 
about equal and his Table 4 (Fig. 1) 
suggested statistical equality in RS for 
assigned parents, he did not report the 
frequency of M'M^s or the test for paren- 
tal equality of RS in his entire experi- 
ment. Had he reported a test of observed 
frequencies of M'M 17 , M ! w", w'M" and 
wV against Mendel's expectations 
of M 9 M S , MV, w 9 M J and wV, he 
would have provided the information that 
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Figure 4. Subjects seemingly without mates had offspring, which is highly unlikely in a sexually reproducing diploid species. Reproductive success 
counted as the sum of M'M" plus M'w" for female subjects (A) and as M 9 M" plus w'M" for male subjects (B) against the number of mates counted 
from M 9 M ff offspring for females (A) and for males (B) exposes a biological impossibility. Bateman's method overestimates the number of individuals 
with zero mates (21 subjects among females and 43 among males), thereby underestimating the number with one or more mates. The magnitude of 
error among male subjects (43 males out of a total N of 166 males) was greater than among female subjects (21 females out of a total N of 166 females), 
which would have falsely increased the V NM estimates of males relative to females. We use these plots to illustrate one of the most egregious errors in 
Bateman's method: subjects who seemed to have no mates had offspring. 



his contemporaries and modern readers 
needed to evaluate the reliability of his 
markers given his questions. Although it 
appeared to be "cutting edge" in its day, 
the methodology of Bateman's study 
appears to have been fatally flawed. 

Our replication showed that using 
Bateman's method produced two obser- 
vations that are biologically impossible 
or at least extremely unlikely. Figure 4 
shows the first by examining inferences 
about NM and RS for the 166 female 
subjects (A) and 166 male subjects (B) 
in the replication. The x-axis is the num- 
ber of mates for subjects counted from 
M'M" offspring, the only offspring pro- 
viding information about who mated 
with whom. The y-axis is reproductive 
success counted as the 2 = M 9 1VF + 
M 9 -w a (for female subjects, the mothers) 
or 2 = M'M" + w'lVP (for male subjects, 
the fathers). A single fact exposes the 
bias in Bateman's method: Some indi- 
viduals that we counted using Bateman's 
method as having zero mates nevertheless, 
using Bateman's method, had offspring 
(Fig. 4). The mismatch between infor- 
mation in double mutant and single 
mutant offspring was due in our experi- 
ment to significantly lower viability of 
M ! M" offspring. Remember that single 



mutant offspring inform questions about 
how many offspring each subject had, 
but they are silent about who mated 
with whom. Because of the inviability 
of double mutant offspring, some adults 
were scored as having zero mates, when 
in fact that had some unknown number 
of mates. The resultant miscount incor- 
rectly increased the number of males in 
the class having zero mates, and the num- 
ber with > 1 was consequently underesti- 
mated. Using the method Bateman used, 
Figure 4 shows that Bateman would have 
incorrectly assigned many more males 
than females to the zero class of num- 
ber of mates, so that the effect would 
be to increase the male V.„, relative to 

NM 

the effect on female V XIW . There is no 

NM 

way to know what effect the incorrect 
assignment of subjects to the zero mating 
class has on the under-estimation of the 
subjects in the classes with > 1 mate 
(Fig. 4). 

The second biological impossibility 
that our replication revealed was that there 
was a systematic bias in counts of offspring 
with fathers vs. mothers. That is, using 
Bateman's method we were able to assign 
genetic paternity significantly more often 
than genetic maternity, making it seem as 
though more offspring had fathers than 



mothers, verifying that there was some- 
thing wrong with the methodology, just as 
we 12 had earlier suspected. More paternal 
assignments occurred because the num- 
ber of w'M" offspring was greater than 
the number of M'w"' offspring. We tested 
for significance differences with a plot of 
the difference scores in the apparent num- 
ber of offspring with fathers minus the 
apparent number with mothers (see Fig. 1 
in ref. 30). Because the bias resulted in 
assigning more fathers than mothers as 
parents, the estimate of RS was greater 
for fathers than mothers, demonstrably 
because of mismeasurements derived from 
Bateman's method: single mutant off- 
spring who inherited the dominant allele 
from their mother's marker locus plus a 
wild-type allele at their fathers had lower 
viability than offspring who inherited a 
dominant allele from their father's marker 
locus but a wild-type from their mother's 
marker locus. Systematic, methodological 
mismeasurement of number of mates and 
number of offspring leaves open to ques- 
tion Bateman's conclusion that there is 
an enhanced effect of RS as a function of 
NM for males, but not females. Because 
of the bias in the methodology, our rep- 
etition could not address the main ques- 
tions that Bateman set out to answer. If his 
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Figure 5. Frequencies of M 9 M", M 9 w", w'M" and w'w" offspring from monogamy trials (five sets for each parental marker combination). The column 
on the far left identifies the offspring types: White bars = IvI'M*, light gray bars = M'w", dark gray bars = w'M" and black bars = wV. 



overall data, not just the data in his Table 4 
(Fig. 1), were inadequate to his questions 
as our repetition suggests, his answers to 
his questions would be unreliable as well. 

M 9 I\T Offspring 
from Monogamous Pairs 
also were Inviable 

A control that was available to Bateman, 
but that he did not report having done was 
to test the frequencies of M 9 M CJ , M'w 3 , 
w'M 3 and w'w" in trials of monogamous 
pairs representing each possible combina- 
tion of parents. Such a test a priori would 
have informed Bateman of the fairness of 
the markers as unbiased indicators of par- 
entage. Notably because sexual selection 
is absent in strict monogamy, it would 
also have provided a strong comparative 
test of the force of sexual selection in his 
populations. 



We reported monogamy trials in our 
original report along with the tabled 
counts and frequencies of M'M", M ? w", 
w'M" and w'w 3 for each combination of 
heterozygote dominant female and male 
(see Table S4 in ref. 30). We showed that 
even under monogamy that only four out 
of 25 of the pair combinations (each with 
five trials) had M'M" at or near 25% (see 
tabled data, Table S14 in ref. 30). The vast 
majority were under 25%. Figure 5 shows 
graphically the distribution of M ! M", 
M'w 3 , w'M" and w'w" of offspring for 
each set of pairs in the monogamy trials 
that included analysis of 12,005 offspring 
phenotypes. Overall M'M 3 were signifi- 
cantly less frequent than the Mendelian 
expected frequency of 25% . Had Bateman 
performed trials of this sort, he would 
have known that his methodology — at 
least with the mutations he used as marker 
"nametags" for his potentially breeding 



subjects — was inappropriate for and inca- 
pable of answering his questions about 
who mated with whom, which provided in 
his experiment the only data on the num- 
ber of mates for each subject. 

In addition, had Bateman performed 
monogamy trials, he a priori could have 
observed that his methodology — at least 
with the mutations he used — was also 
inappropriate for evaluating reproductive 
success. Consider that counts of M'M" 
offspring are necessarily equal for moth- 
ers and fathers. However, counts of M'w" 
and w 9 M tf may differ. The question then 
arises: Are counts ofM V and w 9 M (J sig- 
nificantly different from 50% of offspring 
with fathers and 50% with mothers? A 
simple way to answer this question is in 
Figure 6 that shows the frequency of dif- 
ference scores for numbers of M'w' 5 ' and 
w in each set of the monogamy trials. 
Single mutant offspring reveal only one of 
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Figure 6. The difference in the number of offspring assigned to mothers and fathers computed from M 9 w" and w 9 M" offspring that individual adults 
confined in monogamous pairs produced. There were five trials of monogamous pairs in each parental combination of unique markers. The distribu- 
tion of difference scores was significantly different from 0 (t-test = -3.7004, DF = 24, p > |t| = 0.0011). 



their parent's identities; thus on average 
for each parental set in the monogamy tri- 
als averaged over the five trials per set, we 
expect there would be no systematic dif- 
ferences in the number of offspring that 
were scored as having mothers and fathers. 
Figure 6 shows that more offspring appar- 
ently had fathers than had mothers, which 
is a biological impossibility. The reasons 
for the bias in the inferences appear to be 
that in M V offspring, the father's wild- 
type alleles may fail to rescue offspring 
from the deleterious effects of the moth- 
er's marker allele. Alternatively, in w'JVF, 
the mother's wild-type alleles may have 
rescued offspring from the deleterious 
effects of the father's marker allele. This is 
speculation, of course. There is little way 
to know what explains the apparent dif- 
ference in the number of offspring with 
fathers compared with the offspring with 
mothers without additional experiments 
on the effects of the mutations singly and 
in combination. 

In sum, the monogamy trials did reveal 
that the marker genes Bateman used were 
deleterious to offspring when inherited 
together. It is worth noting that half of the 
markers Bateman used and that we used 



were homozygous lethal (see Table 1 in 
Bateman 1 and Table 1 in ref. 30); however, 
no potential parent in either Bateman's or 
our experiment was a homozygote for their 
marker allele. More important, because 
each parent had a unique marker locus, no 
offspring could be homozygous dominant 
for any single one of the parental marker 
loci. Thus, the deleterious effects on off- 
spring viability are from inheritance of 
the parental marker alleles in combination 
(Fig. 5). 

What can the Replication 
Say about the Original Study? 

What can we make of Bateman's original 
study and his claims given our study? First 
and most important, if throughout his 
study double mutant offspring occurred 
in significantly lower frequencies than 
25% as they did in the one population for 
which he displayed all the offspring phe- 
notypes (Fig. 1), his entire study, not just 
one population, would have been biased. 
Our replication emphasizes the likelihood 
of that possibility. 

The replicated results show that in 
all but one of our 46 populations M 9 M a 



offspring were significantly less frequent 
than expected. If that also were true for 
Bateman's original study, Bateman too 
would have overestimated the number of 
adults that failed to mate, underestimated 
the number who mated one or more times 
and perhaps may also have assigned par- 
entage to more fathers than mothers, 
a biological impossibility in species in 
which offspring have both a mother and 
a father. If Bateman's method was flawed 
as our repetition suggests it was, he had 
no valid evidence for sex differences in 
numbers of mates or reproductive success, 
much less for V NM and V RS . In keeping 
with Bateman's method, because we did 
not watch behavior, we have no basis for 
claiming that males mated indiscrimi- 
nately or that females were indifferent to 
mating more than once. 

Critics may claim that the mutant lines 
we used may have undergone significant 
evolutionary change since Bateman's day 
that affected the results. That, of course, is 
a possibility, yet the hypothesis that guided 
our research was the evidence in Bateman's 
own study (Fig. 1) that double mutant off- 
spring were significantly less frequent than 
Mendel's expected frequency of V4. Our 
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results are very like the results in the only 
bit of data in Bateman's study (Fig. 1) that 
allowed a test against Mendel's expecta- 
tions. And, it remains the case that with- 
out a time machine, it would be difficult 
for anyone today to replicate Bateman's 
method. We think we did the best that 
anyone could do by using the lines of flies 
Bateman used that are still available today. 

Thus, our repetition raises potentially 
unanswerable questions about Bateman's 
original studies. Were the double mutant 
offspring in his experiment as likely to die 
as in the repetition and as frequently as 
the offspring must have done in his Table 
4 (Fig. 1) population? Bateman did seem 
to assign more offspring to fathers than to 
mothers in his overall study; 12 however, 
the investigators reporting that observa- 
tion worked only from Bateman's pub- 
lished data, which Bateman included in 
summary tables that lacked some details. 
These are questions that only a time 
machine can conveniently answer now. 
And, perhaps that is the main value of our 
repetition: It may tell us all that we can 
know now about Bateman's original study. 

Given that Bateman did his study 
60 years ago and given that its citation 
frequency soared in the mid-1970s, it 
remains curious that it was not repeated 
post-1972. We continue to wonder why. 
Were Bateman's results so intuitive and 
comfortable that investigators didn't think 
to retest it? It is true that textbook authors 
"simplified" a description of Bateman's 
gradients leaving out more than half 
the information available, 3,7 something 
that could have misled students, increas- 
ing their "comfort zone" with the mate- 
rial, but surely could not account for the 
lack of vigilance of professionals. Was a 
repetition just too hard to do? We admit 
that it was a challenging, complex, time- 
consuming experiment to complete, so we 
can imagine that others began but did not 
complete a repetition. We also wonder if 
perhaps other strict repetitions do exist 
but remain unpublished. Are there strict 
replications whose authors never offered 
them up for publication? Are there strict 
replications that were rejected? That too 
may be a question whose answer is lost to 
time. But, the unanswered questions con- 
tinue to beg why our field did not repeat 
the classic sooner. 



Our field might profitably do some 
soul-searching: Why were Bateman's 
obvious errors overlooked for so long? As 
we said 30 in our primary report, legions 
of graduate students have for the past 40 
years read and discussed Bateman. Why 
did they not bring attention to the errors? 
Surely all of them, among biologists at 
least, understand the elements of muta- 
tion, inheritance and Mendelian genetics. 
Why did their professors not challenge 
Bateman's results? We are inclined to the 
idea that Bateman's results and conclu- 
sions are so similar to status quo, domi- 
nating world-views (competitive males, 
dependent females) that pre-existing cul- 
tural biases of readers may have damp- 
ened skepticism and objectivity. Perhaps 
lack of repetition is simply due to lack of 
professional incentives such as funding for 
repetitions. 

Why does the Repetition 
Challenge "the Paradigm"? 

It seems that the modern bedrock of sex- 
ual selection may have been quicksand. 
But does a single replication of an impor- 
tant study invalidate an entire field? We 
do not think so. 

Yet in the company of other challenges 
to the competitive male and discriminat- 
ing female paradigm, the repetition gives 
pause, 33 particularly in light of the "prob- 
lems with paradigms." 7 ". . .paradigms act 
as a "lingua franca" that facilitates commu- 
nication among scientists. On the negative 
side, paradigms can lead to simplification, 
can blind us to phenomena that do not fit 
the accepted world-view, can guide us to 
accept hypotheses that are unfounded, and 
can prevent us from considering alternative 
hypotheses and explanations. In such cases 
paradigms become dogma and have detri- 
mental effects on the development of scien- 
tific ideas" 1 (p. 821). Perhaps we should 
routinely urge our students to challenge 
paradigmatic material. Perhaps we need 
new rules such as "if it's intuitive, test it." 
Are we as a field asking the right ques- 
tions? For example, could both males 
and females in most species assess oth- 
ers' quality as mates before accepting or 
rejecting potential mates, 14,34 as happens 
in D. pseudoobscuraV''' Could males and 
females in all species be simultaneously 



choosy and competitive? 14,34 Might the 
mechanisms of sexual selection act on 
number of mates among males, but on 
quality of mates among females? 36 Or 
more radically, might both females and 
males enjoy significant fitness benefits 
through mate assessment and adaptively 
flexible mate choice based on their con- 
stantly updated predictions of the viabil- 
ity of potential offspring? 37 What exactly 
does a Bateman gradient tell about 
among-female competition over the qual- 
ity of mates? What if female and male 
reproductive competition is mediated in 
one sex via within sex variation in qual- 
ity of mates and in the other via within 
sex variation in number of mates? Would 
the metric of sexual selection then be the 
same for females and males? If females 
and males do compete over different 
things, perhaps we need a new metric for 
evaluating the effect on V RS of variance in 
mate quality (V MQ ). 

Questions about Modern Studies 
of Bateman Gradients 

A probable rejoinder from some readers 
of our recent report 30 is that the repeti- 
tion fails to weaken Bateman's principles, 
because other studies have validated them 
(see above for examples). Even in the face 
of validation from other studies in other 
species, the repetition does matter, because 
it puts in a different, more interesting 
light the published studies with results 
inconsistent with Bateman's conclusions, 
and it demands that investigators with 
results consistent with Bateman's conclu- 
sions take care of at least two imperative 
remaining concerns: (1) Observation bias 
and (2) failure to test alternative hypoth- 
eses for fitness variances. 

Genetic parentage studies allow infer- 
ences about mating systems when copu- 
lations and identification of individuals 
are hard to observe. Some, but far from 
most, modern investigators of genetic 
parentage watch copulations and there- 
fore have an independent estimate of the 
possibility that this individual mated 
with this and that potential mate. But, 
copulations that actually occur may be 
invisible to molecular genetic testing 
even when genetic markers are neutral, 
fit Hardy-Weinberg expectations, or have 
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low mutation frequencies, because sperm 
may be inviable or females might not 
use the sperm. Thus, investigators must 
watch behavior of individually identified 
subjects and simultaneously account for 
"detection bias" in the field 38,3 ' and in the 
laboratory. 

It is rarely possible to identify all 
potential breeders in a wild-living pop- 
ulation, something that challenges all 
genetic parentage studies in the field, par- 
ticularly of organisms that fly, flies and 
birds being notable examples. Laboratory 
populations have walls around them; 
wild-living populations do not. But, 
in truth it is just plain hard to identify 
individual fruit flies in a jar without 
intrusive manipulations, which could 
bias subjects' mate assessments, or con- 
tests among rivals for resources or mate 
access. Thus, it is a real possibility that 
observational biases are near ubiquitous 
in parentage studies on wild and captive 
organisms as well. We suggest caution as 
it is likely that estimates of fitness vari- 
ances are often invisibly biased, with 
consequent effects on the V NM that many 
consider the key mechanism of sexual 
selection. 
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